Efficient parallelization for 3d-3v sparse grid Particle-In-Cell: Shared memory architectures
نویسندگان
چکیده
Particle-In-Cell (PIC) schemes are ones of the most broadly used numerical methods in kinetic simulation plasmas. The contribution present paper is dedicated to introduction novel parallelization strategies specific shared memory architectures tailored for implementing sparse grid reconstructions. These operate different parallelisms genuine Sparse-PIC obtain speed-up exceeding 100 on 128 cores using a strong scaling study 3d-3v simulation. On top that, substantial gains (two orders magnitude) introduced sequential as well parallel computation electric field thanks hierarchization procedure. It consists decomposing information carried by grids onto hierarchical basis functions, entailing significantly reduced number operations.
منابع مشابه
Efficient Parallelization of Unstructured Reductions on Shared Memory Parallel Architectures
This paper presents a new parallelization method for an ef-cient implementation of unstructured array reductions on shared memory parallel machines with OpenMP. This method is strongly related to parallelization techniques for irregular reductions on distributed memory machines as employed in the context of High Performance Fortran. By exploiting data locality, synchronization is minimized with...
متن کاملComparison of Parallelization Frameworks for Shared Memory Multi-Core Architectures
The development of standard multi-core processors changed in the last years moving from bigger, more complex, and faster cores to putting several more simple cores onto one chip. This changed also the way programs are written in order to leverage the processing power of multiple cores of the same processor. In the beginning, programmers had to divide and distribute the work by hand to the avail...
متن کاملcient Parallelization of Unstructured Reductions on Shared Memory Parallel Architectures ?
This paper presents a new parallelization method for an efcient implementation of unstructured array reductions on shared memory parallel machines with OpenMP. This method is strongly related to parallelization techniques for irregular reductions on distributed memory machines as employed in the context of High Performance Fortran. By exploiting data locality, synchronization is minimized witho...
متن کاملEfficient and Scalable Algorithms for Smoothed Particle Hydrodynamics on Hybrid Shared/Distributed-Memory Architectures
This paper describes a new fast and implicitly parallel approach to neighbour-finding in multi-resolution Smoothed Particle Hydrodynamics (SPH) simulations. This new approach is based on hierarchical cell decompositions and sorted interactions, within a task-based formulation. It is shown to be faster than traditional tree-based codes, and to scale better than domain decomposition-based approac...
متن کاملShared Memory Parallelization of Data
With the availability of large datasets in application areas like bioinformatics, medical informatics, scientific data analysis, financial analysis, telecommunications, retailing, and marketing, it is becoming increasingly important to execute data mining tasks in parallel. At the same time, technological advances have made shared memory parallel machines commonly available to organizations and...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of Computational Physics
سال: 2023
ISSN: ['1090-2716', '0021-9991']
DOI: https://doi.org/10.1016/j.jcp.2023.112022